# AREA EFFICIENT FIXED-WIDTH BOOTH MULTIPLIERS WITH HIGH ACCURACY

SUJEETHA.G, AARTHI.C

DEPARTMENT OF M.E VLSI DESIGN, ASSISTANT PROFESSOR DEPARTMENT OF ECE SENGUNTHAR ENGINEERING COLLEGE, SENGUNTHAR ENGINEERING COLLEGE TIRUCHENGODE, INDIA.

sujeethag@rediffmail.com

ABSTRACT-In this project a single compensation formula of adaptive conditional-probability estimator (ACPE) applied to fixed-width Booth multiplier is proposed. Based on the conditional- probability theory, the ACPE can be easily applied to large length Booth multipliers (such as 32-bit or larger) for achieving a higher accuracy performance. To consider the trade-off between accuracy and area cost, the ACPE provides varying column information w to adjust the accuracy with respect to system requirements.

The 16-bit ACPE Booth multiplier with w = 3 reduces 28.9% silicon area with only 0.39 dB signal-to-noise ratio (SNR) loss when compared with post-truncated (P-T) Booth multiplier. Furthermore, the ACPE Booth multipliers are applied to two-dimensional (2-D) discrete cosine transform (DCT) to evaluate the system performance. Implemented in a TSMC 0.18 m CMOS process, the DCT core with ACPE (w = 3) can save 14.3% area cost with only 0.48 dB peak-signal-to-noise-ratio (PSNR) penalty compared to P-T method. According to the applications of larger bit width fixed-width multipliers, like pseudo random number generator (PRNG), huge number of time will be consumed to establish the compensation function based on exhaustive simulation. Therefore, a compensation function is presented in binary-sign-digit (BSD) Booth multipliers by using condition-probability method.

The conditional bits in are not chosen well. Thus, the compensation circuits cannot improve the performance in accuracy when compared with existing works. The probabilistic estimation bias is presented to replace the time-consuming exhaustive simulation methods and it also exhibits good accuracy performance.

INDEX TERMS-Adaptive conditional-Probability estimator(ACPE), Booth multiplier, Discrete cosine transform (DCT),fixed-width.

#### 1.INTRODUCTION

FIXED-WIDTH multipliers are the important components in digital signal processing (DSP) systems . In general, they truncate the least significant half part directly to generate the output with the same word length as input, and the computation errors occur from the operators that perform the truncation. Therefore, many compensation methods are presented to reduce the truncation errors in Baugh-Wooley array multipliers and Booth multipliers.

Booth multipliers are popular due to the reduced elements of the partial products. The fixedwidth Booth multiplier with the best accuracy is the post-truncated (P-T) multiplier, which uses rounding off operator after calculating all products. However, the P-T multiplier consumes large silicon area in very large scale integration (VLSI) designs. In order to reduce the area cost, the direct-truncated (D-T) multiplier chops the least significant half partial products directly. Recently, many works use more information from Booth encoder and partial products to achieve higher accuracy performance. In , taking more information provided by Booth encoder, the compensation circuits can reduce the truncation errors.Wang *et al.* further improve the work of truncation errors can be alleviated evidently. Nevertheless, the area cost is increased from the extra information of compensation circuits.

There is a trade-off between accuracy and area cost. Song *et al.* introduce the column information to provide more choices between accuracy and area cost . The compensation function is also established by the exhaustive simulation to adjust the accuracy with respect to system requirements based on varying . In this paper, a high-accuracy adaptive conditionalprobability estimator (ACPE) is proposed to be applied in fixed-width Booth multipliers.

Instead of the time-consuming exhaustive simulation of the previous works, the ACPE is derived from the conditional-probability theory.

Thus, the ACPE can be easily applied to large length Booth multipliers (such as 32-bit or larger).

#### **1.1 BOOTH'S MULTIPLIER:**

Booth multiplier will multiply a\*b where a is multiplicand and b is multiplier. The key to Booth's insight is to divide the groups bit of multiplier into 3 parts: the beginning, the middle, or the end of a run of 1s.

In general, truncate the least significant half part directly to generate the output with the same word length as input, and the computation errors occur from the operators that perform the truncation. Booth multipliers are popular due to the reduced elements of the partial products. In this paper, a highaccuracy adaptive conditional-probability estimator (ACPE) is proposed to be applied in fixed-width Booth multipliers. Thus, the ACPE can be easily applied to large length Booth multipliers (such as 32bit or larger). Requirements, the ACPE also provide varying column information (W) to adjust the system accuracy. Furthermore, the ACPE Booth multipliers are also applied to a two-dimensional (2-D) discrete cosine transform (DCT) to demonstrate the performance of real applications.

#### **1.2 SCOPE OF THE PROJECT:**

In this project, a high-accuracy adaptive conditional-probability estimator (ACPE) is proposed to be applied in fixed-width Booth multipliers. Instead of the time-consuming exhaustive simulation of the previous works, the ACPE is derived from the conditional-probability theory. Thus, the ACPE can be easily applied to large length Booth multipliers (such as 32-bit or larger). As a result, the ACPE can reduce 14.3% area cost compared to P-T method with only 0.48 dB peak-signal-to-noise-ratio (PSNR) lost. Consequently, the proposed ACPE saves the establishment time of compensation circuit, provides varying column information, and achieves the high accuracy performance in a single compensation function.



Fig 1.2 Grouping Bits of Booth Encoder

|    |      |      |      |      |        |      |      | a7   | a6   | a5   | a4   | a3   | a2   | al   | a0   |
|----|------|------|------|------|--------|------|------|------|------|------|------|------|------|------|------|
|    |      |      |      |      |        | Х    |      | b7   | b6   | b5   | b4   | b3   | b2   | b1   | b0   |
|    |      |      |      |      |        | _    | P0_8 | P0_7 | P0_6 | P0_5 | P0_4 | P0_3 | P0_2 | P0_1 | P0_0 |
|    |      |      |      |      | P1_8   | P1_7 | P1_6 | P1_5 | P1_4 | P1_3 | P1_2 | P1_1 | P1_0 |      | n0   |
|    |      |      | P2_8 | P2_3 | 7 P2_6 | P2_5 | P2_4 | P2_3 | P2_2 | P2_1 | P2_0 |      | n1   |      |      |
|    | P3_8 | P3_7 | P3_6 | P3_5 | 5 P3_4 | P3_3 | P3_2 | P3_1 | P3_0 |      | n2   |      |      |      |      |
|    |      |      |      |      |        |      |      |      | n3   |      |      |      |      |      |      |
| 15 | p14  | p13  | p12  | p11  | p10    | p9   | p8   | p7   | рб   | p5   | p4   | p3   | p2   | p1   | рØ   |
|    |      |      |      |      |        |      |      |      |      |      |      |      |      |      |      |

Fig. 1. 1.8-bit Booth Multiplier

II. FIXED-WIDTH MODIFIED BOOTH MULTIPLIER

Modified Booth encoding is popular for reducing the number of partial products ]. The -bit product can be expressed in two's complement representation as follows: (1) Three concatenated inputs , , and can be mapped into by using Booth encoder, as tabulated in Table I, where nonzero code is a 1-bit digit whose value is determined by whether equals to zero. There are rows in partial product array producing by Booth encoding, where is an even number.

p)



According to the mapping Tables , Booth encoder will generate the partial products.

| $y_{2i+1}$ | $y_{2i}$ | $y_{2i-1}$ | $y_i'$ | $nz_i$ |
|------------|----------|------------|--------|--------|
| 0          | 0        | 0          | 0      | 0      |
| 0          | 0        | 1          | 1      | 1      |
| 0          | 1        | 0          | 1      | 1      |
| 0          | 1        | 1          | 2      | 1      |
| 1          | 0        | 0          | -2     | 1      |
| 1          | 0        | 1          | -1     | 1      |
| 1          | 1        | 0          | -1     | 1      |
| 1          | 1        | 1          | 0      | 0      |

# TABLE 1.1 Modified booth Encoder

| $n_{Q-1}$ | $p_{0,Q-1}$ | $p_{L,0}$ | s2 | $s_1$ | $s_0$ | λ | $e_{Q-1}$ |
|-----------|-------------|-----------|----|-------|-------|---|-----------|
| 0         | 0           | 0         | 1  | 0     | 0     | 1 | 0         |
| 0         | 0           | 1         | 0  | 1     | 1     | 1 | 0         |
| 0         | 1           | 0         | 1  | 0     | 0     | 1 | 1         |
| 0         | 1           | 1         | 0  | 1     | 1     | 1 | 1         |
| 1         | 0           | 0         | 1  | 0     | 0     | 1 | 1         |
| 1         | 0           | 1         | 0  | 1     | 1     | 1 | 1         |
| 1         | 1           | 0         | 1  | 0     | 1     | 0 | 0         |
| 1         | 1           | 1         | 1  | 0     | 0     | 0 | 0         |

TABLE 1.2 Modified booth Encoder partial products

The partial product array also can be divided into two parts: the main part (MP) including the most significant 'L' columns, and the truncation part (TP) including the least significant 'L' columns. Besides, the column information "w" is included to adjust accuracy with respect to system requirements, and 'W' means that most significant columns (MSCs) are calculated and the MSC is chosen to estimate the compensation values.

#### fig.1.3Ttruncated parts of lambda=1,2,3.



## 2.ALGORITHM:

#### ADAPTIVE CONDITIONAL-ROBABILITY ESTIMATOR FOR FIXED-WIDTH BOOTH MULTIPLIER

when the multiplier and multiplicand enters the booth encoder according to the table the partial products were computed ad sent to csa tree and compensation circuit and finally all the partial sum and carry products were added at parallel prefix adder unit and pq will be the quantized product

Modified Booth encoding is popular for reducing the number of partial products. The 2L -bit product P can be expressed in two's complement representation as follows:

$$X = -x_{L-1}2^{L-1} + Y = -y_{L-1}2^{L-1} + P = X x Y$$



fig 1.4 PROPOSED BLOCK DIAGRAM

Three concatenated inputs  $y_{2i+1}$ ,  $y_{2i}$ , and  $y_{2i-1}$  can be mapped  $y_{i'}$  into by using Booth encoder, as tabulated in Table I, where nonzero code is a 1-bit digit whose value is determined by whether  $y_{i'}$  equals to zero. There are Q=L/2 rows in partial product array producing by Booth encoding, where L is an even number.

> {s2,s1,s0, $\lambda$ ,e<sub>Q-1</sub>}={p<sub>L,0</sub>,p<sub>L,0</sub>,p<sub>L,0</sub>,1,p<sub>0</sub>,Q-1}+{0,0,0,0,n<sub>O-1</sub>}

The partial product array in Fig. 1(b) also can be divided into two parts: the main part (MP) including the most significant L columns, and the truncation part (TP) including the least significant L columns. Besides, the column information  $\omega$  is included to adjust accuracy with respect to system requirements, and  $\omega$  means that L+  $\omega$  most significant columns (MSCs) are calculated and the (L+  $\omega$ +1)<sup>th</sup> MSC is chosen to estimate the compensation values. Therefore, L+  $\omega$ +1 MSCs are used to produce the results. Taking  $\omega$ =2 as an example, the results are calculated by the partial products of most significant L+3(=L+w+1) columns.

When fixed-width multiplication is concern, the quantized product P q can be expressed as follows:

$$P \approx P_q = MP + TP$$
  
 $MP + \sigma 2^L$ .....(a)

Where  $\sigma$  represents the bias of adaptive conditionalprobability estimator (ACPE) which can be decomposed further into TP<sub>major</sub> and TP<sub>minor</sub> parts as following equations.

$$\sigma = [TP_{minor} + TP_{major}]$$
$$= [TP_{minor} + 2^{-\omega}] \dots (b)$$

Compensation is planning for side effects or other unintended issues in a design. The design of an

invention can itself also be to compensate for some other existing issue or exception.

The compensation circuit is implemented to compute compensation bias based on



Fig1.5: Compensation circuits of ACPE Booth multipliers for w=1, 2, 3. (a) w=1. (b) w=2. (c) w=3.

#### 2.1 COMPENSATION CIRCUIT:

Compensation is planning for side effects or other unintended issues in a design. The design of an invention can itself also be to compensate for some other existing issue or exception.

The compensation circuit is implemented to compute compensation bias based on

$$= [TP_{major} + TP_{minor}]$$
$$= [TP_{major} + 2^{-w}_{w}]$$

#### 2.2 CSA TREE:

The carry-save adder generally consists of two ripple carry adders and a multiplexer. Adding two n-bit numbers with a carry-select adder is done with two adders (therefore two ripple carry adders) in order to perform the calculation twice, one time with the assumption of the carry being zero and the other assuming one. After the two results are calculated, the correct sum, as well as the correct carry, is then selected with the multiplexer once the correct carry is known.

The CSA tree and Brent-Kung parallelprefix adder sum up the partial products of MP and based on

# $PP_q=MP+TP$ =MP+.2<sup>L</sup>



Fig1.6.The proposed 64-bit ACPE compensation circuit with w=4

#### **3.RESULTS**

# SIMULATION REPORTS PARTITION OF TP w=1



#### ROW2:



## ROW3:



#### ROW4:



#### ROW5:



#### **OVER ALL PARTIAL PRODUCTS:**



#### 3.1.2 PARTITION OF TP w=2:



#### ROW2:



#### ROW3:



#### ROW4:



#### ROW5:



#### **OVER ALL PARTIAL PRODUCTS:**



# **3.2 COMPENSATION CIRCUIT**

#### 3.2.1 COMPENSATION W=1:



3.2.2 COMPENSATION w=2:



#### **4.CONCLUSION**

This paper proposes an adaptive conditionalprobability estimator (ACPE) in fixed-width Booth multipliers. Without heuristic method, the ACPE is derived from conditional-probability theory, which can be easily applied to large length Booth multipliers for achieving higher accuracy performance. The column information is also induced in ACPE to adjust the accuracy with respect to system requirements. For the 2-D DCT implementation with the ACPE Booth multipliers, an excellent performance is exhibited in terms of PSNR and area cost. As a result, the proposed ACPE provides a flexible and high accuracy compensation circuit applied to fixed-width Booth multipliers.

|                         | Characteristic |                           |  |  |  |
|-------------------------|----------------|---------------------------|--|--|--|
| Shift-Register<br>Array | Technology     | 0.18 $\mu$ m 1P6M         |  |  |  |
|                         | Supply power   | 1.8V                      |  |  |  |
| Kernel                  | Core size      | 509 $\mu$ m x 518 $\mu$ m |  |  |  |
|                         | Gate Count     | 18 K                      |  |  |  |
| M1 M2 M3 M4             | Max Freq.      | 55 MHz                    |  |  |  |
|                         | Power          | 17.6 mW @55MHz            |  |  |  |

FIG.Core layout and characteristics of the proposed ACPE with  $\hfill\square$  \_ applied in DCT core.



FIG. The conditional expected values of TP.

#### **5. REFERENCES**

[1] J. M. Jou, S. R. Kuang, and R. D. Chen, "Design of low-error fixed width multipliers for DSP applications," IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 46, no. 6, pp. 836–842, Jun. 1999.

[2] L. D. Van, S. S. Wang, and W. S. Feng, "Design of the lower error fixed-width multiplier and its application," IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 47, no. 10, pp. 1112– 1118, Oct. 2000.

[3] A. G. M. Strollo, N. Petra, and D. DeCaro, "Dualtree error compensation for high performance fixedwidth multipliers," IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 52, no. 8, pp. 501–507, Aug. 2005.

[4] C. H. Chang and R. K. Satzoda, "A low error and high performance multiplexer-based truncated multiplier," IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 18, no. 12, pp. 1767–1771, Dec. 2010.

[5] C. Y. Li, Y. H. Chen, T. Y. Chang, L. Y. Deng, and K. W. To, "Period extension and randomness enhancement using high-throughput reseedingmixing PRNG," IEEE Trans. Very Large Scale Integr. (VLSI) Syst., 2011, to be published.

[6] H. Y. Lee and I. C. Park, "Balanced binary-tree decomposition for area efficient pipelined FFT processing," IEEE Trans. Circuits Syst. I, Reg. Projects, vol. 54, no. 4, pp. 889–900, Apr. 2007.

[7] J. H. Tu and L. D. Van, "Power-efficient pipelined reconfigurable fixed width Baugh-Wooley multipliers," IEEE Trans. Comput., vol. 58, no. 10, pp. 1346–1355, Oct. 2009.

[8] K. K. Parhi, VLSI Digital Signal Processing Systems: Design and Implementation. New York: Wiley, 1999.

[9] L. D. Van and C. C. Yang, "Generalized lowerror area-efficient fixed width multipliers," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 52, no. 8, pp. 1608–1619, Aug. 2005.

[10] N. Petra, D. D. Caro, V. Garofalo, E. Napoli, and A. G. M. Strollo, "Design of fixed-width multipliers with linear compensation function," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 58, no. 5, pp. 947–960,May 2011.

[11] N. Petra, D. D. Caro, V. Garofalo, E. Napoli, and A. G. M. Strollo, "Truncated binary multipliers with variable correction and minimum mean square error," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 57, no. 6, pp. 1312–1325, Jun. 2010.

[12] P. Bougas, P. Kalivas, A. Tsirikos, and K. Z. Pekmestzi, "Pipelined array-based FIR filter folding," IEEE Trans. Circuits Syst. I, Reg. Projects, vol. 52, no. 1, pp. 108–118, Jan. 2005.

[13] S. C. Hsia and S. H.Wang, "Shift-register-based data transposition for cost-effective discrete cosine transform," IEEE Trans. Very Large Scale Integer. (VLSI) Syst., vol. 15, no. 6, pp. 725–728, Jun. 2007.

[14] S. R. Kuang and J. P. Wang, "Low-error configurable truncated multipliers for multiply-accumulate applications," Electron. Letts., vol. 42,no. 16, pp. 904–905, Mar. 2006.

[15] Yuan-Ho Chen, *Student Member, IEEE*, and Tsin-Yuan Chang, *Member, IEEE* "A High-Accuracy Adaptive Conditional-ProbabilityEstimator for Fixed-Width Booth Multipliers, "Yuan-Ho Chen, *Student Member, IEEE*, and Tsin-Yuan Chang, *Member, IEEE*, VOL. 59,NO. 3,MAR 2012